On a Connection between Object Localization with a Generative Template of Features and Pose-space Prediction Methods
نویسندگان
چکیده
We address the task of localizing objects from a given object class in an image. The image is represented as a collection of “visual words” at interest points. The generative template of features (GTF) model defines a distribution over visual words and their spatial locations for each part of the object (Sudderth et al., 2005; Fergus et al., 2005). We show how to derive pose-space prediction methods (such as the Hough transform) from the GTF.
منابع مشابه
استفاده از برآورد حالتهای پویای دست مبتنی بر مدل، برای تقلید عملکرد بازوی انسان توسط ربات با دادههای کینکت
Pose estimation is a process to identify how a human body and/or individual limbs are configured in a given scene. Hand pose estimation is an important research topic which has a variety of applications in human-computer interaction (HCI) scenarios, such as gesture recognition, animation synthesis and robot control. However, capturing the hand motion is quite a challenging task due to its high ...
متن کاملModeling Pose/Appearance Relations for Improved Object Localization and Pose Estimation in 2D images
We propose a multiview model of appearance of objects that explicitly represents their variations of appearance with respect to their 3D pose. This results in a probabilistic, generative model capable of precisely synthesizing novel views of the learned object in arbitrary poses, not limited to the discrete set of trained viewpoints. We show how to use this model on the task of localization and...
متن کاملEfficient Unsupervised Learning for Localization and Detection in Object Categories
We describe a novel method for learning templates for recognition and localization of objects drawn from categories. A generative model represents the configuration of multiple object parts with respect to an object coordinate system; these parts in turn generate image features. The complexity of the model in the number of features is low, meaning our model is much more efficient to train than ...
متن کامل3D Models Recognition in Fourier Domain Using Compression of the Spherical Mesh up to the Models Surface
Representing 3D models in diverse fields have automatically paved the way of storing, indexing, classifying, and retrieving 3D objects. Classification and retrieval of 3D models demand that the 3D models represent in a way to capture the local and global shape specifications of the object. This requires establishing a 3D descriptor or signature that summarizes the pivotal shape properties of th...
متن کاملPlanar PØP: Feature-less pose estimation with applications in UAV localization
We present a featureless pose estimation method that, in contrast to current Perspective-n-Point (PnP) approaches, it does not require n point correspondences to obtain the camera pose, allowing for pose estimation from natural shapes that do not necessarily have distinguished features like corners or intersecting edges. Instead of using n correspondences (e.g. extracted with a feature detector...
متن کامل